Picture for Changsheng Xu

Changsheng Xu

Disentanglement-Based Equivariant Learning for Compositional VQA

Add code
Jun 01, 2026
Viaarxiv icon

Boosting Multimodal Federated Learning via Chained Modality Optimization

Add code
Jun 01, 2026
Viaarxiv icon

Towards Domain-Generalized Open-Vocabulary Object Detection: A Progressive Domain-invariant Cross-modal Alignment Method

Add code
Mar 29, 2026
Viaarxiv icon

A Step Toward Federated Pretraining of Multimodal Large Language Models

Add code
Mar 25, 2026
Viaarxiv icon

Complementary Text-Guided Attention for Zero-Shot Adversarial Robustness

Add code
Mar 19, 2026
Viaarxiv icon

Replacing Parameters with Preferences: Federated Alignment of Heterogeneous Vision-Language Models

Add code
Jan 31, 2026
Viaarxiv icon

SoMe: A Realistic Benchmark for LLM-based Social Media Agents

Add code
Dec 09, 2025
Viaarxiv icon

LiveStar: Live Streaming Assistant for Real-World Online Video Understanding

Add code
Nov 07, 2025
Viaarxiv icon

Locality Preserving Markovian Transition for Instance Retrieval

Add code
Jun 05, 2025
Viaarxiv icon

Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation

Add code
Jun 05, 2025
Viaarxiv icon